Musical Instrument Recognition using Biologically Inspired Filtering of Temporal Dictionary Atoms
نویسندگان
چکیده
Most musical instrument recognition systems rely entirely upon spectral information instead of temporal information. In this paper, we test the hypothesis that temporal information can improve upon the accuracy achievable by the state of the art in instrument recognition. Unlike existing temporal classification methods which use traditional features such as temporal moments, we extract novel features from temporal atoms generated by nonnegative matrix factorization by using a multiresolution gamma filterbank. Among isolated sounds taken from twenty-four instrument classes, the proposed system can achieve 92.3% accuracy, thus improving upon the state of the art.
منابع مشابه
Title of dissertation : SPARSE AND NONNEGATIVE FACTORIZATIONS FOR MUSIC UNDERSTANDING
Title of dissertation: SPARSE AND NONNEGATIVE FACTORIZATIONS FOR MUSIC UNDERSTANDING Steven Kiemyang Tjoa, Doctor of Philosophy, 2011 Dissertation directed by: Professor K. J. Ray Liu Department of Electrical and Computer Engineering In this dissertation, we propose methods for sparse and nonnegative factorization that are specifically suited for analyzing musical signals. First, we discuss two...
متن کاملBioinspired sparse spectro-temporal representation of speech for robust classification
In this work, a first approach to a robust phoneme recognition task by means of a biologically-inspired feature extraction method is presented. The proposed technique provides an approximation to the speech signal representation at the auditory cortical level. It is based on an optimal dictionary of atoms, estimated from auditory spectrograms, and the Matching Pursuit algorithm to approximate t...
متن کاملRobust method for finding sparse solutions to linear inverse problems using an L2 regularization
We analyzed the performance of a biologically inspired algorithm called the Corrected Projections Algorithm (CPA) when a sparseness constraint is required to unambiguously reconstruct an observed signal using atoms from an overcomplete dictionary. By changing the geometry of the estimation problem, CPA gives an analytical expression for a binary variable that indicates the presence or absence o...
متن کاملTimbre Recognition with Combined Stationary and Temporal Features
In this paper we consider the problem of modeling spectro-temporal behaviour of musical sounds, with applications for musical instrument recognition. Using instanteneous sound features, such as cepstral envelopes and cepstral derivatives, the temporal evolution of the sound is transcribed into a new representation as a sequence of spectral features. Applying information-theoretic sequence match...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کامل